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ABSTRACT 

We discuss the classic theorem according to which a gravitational lens always produces at 
least one image with a magnification greater than unity. This theorem seems to contradict the 
conservation of total flux from a lensed source. The standard solution to this paradox is based 
on the exact definition of the reference 'unlensed' situation, in which the lens mass can either 
be removed or smoothly redistributed. 

We calculate magnifications and amplifications (in photon number and energy flux den- 
sity) for general lensing scenarios not limited to regions close to the optical axis. In this way 
the formalism is naturally extended from tangential planes for the source and lensed images to 
complete spheres. We derive the lensing potential theory on the sphere and find that the Pois- 
son equation is modified by an additional source term that is related to the mean density and 
to the Newtonian potential at the positions of observer and source. This new term generally 
reduces the magnification, to below unity far from the optical axis, and ensures conservation 
of the total photon number received on a sphere around the source. 

This discussion does not affect the validity of the focusing theorem, in which the unlensed 
situation is defined to have an unchanged affine distance between source and observer. The 
focusing theorem does not contradict flux conservation, because the mean total magnification 
(or amplification) directly corresponds to different areas of the source (or observer) sphere in 
the lensed and unlensed situation. We argue that a constant affine distance does not define an 
astronomically meaningful reference. 

By exchanging source and observer, we confirm that magnification and amplification dif- 
fer according to Etherington's reciprocity law, so that surface brightness is no longer strictly 
conserved. At this level we also have to distinguish between different surface brightness defi- 
nitions that are based on photon number, photon flux, and energy flux. 

Key words: Gravitational lensing - Cosmology: miscellaneous 



1 INTRODUCTION 

Already lEinsteiij (Il936h calculated the magnification of a back- 
ground star lensed by a foreground compact mass and found the 
classic result for the total magnification of both images, which de- 
creases for large impact angles (far away from the optical axis) but 
always stays above unity. 

At that time, gravitational lensing was regarded as a theoret- 
ical curiosity with very little chance of leading to observable ef- 
fects. Much later, after the first actual lens systems had been found, 
ISchneideil ^1984) presented and proved a theorem stating that a 
gravitational lens with arbitrary mass distribution will always pro- 
duce at least one image with a magnification greater than 1 . Since 
the theorem is thought to be valid for all configurations of source, 
lens and observer, one might naively think that this necessarily 
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leads to an increase of the total number of photons from the source. 
This would be in contradiction with the fact that lensing can neither 
create nor destroy photons. It was already pointed out bv lSchneideJ 
( Il984h that such an interpretation is not correct. In order to compare 
the lensed with the unlensed situation, it is essential to define the 
latter in a meaningful way, which is far from trivial. In general rel- 
ativity, the lens will slightly distort the geometry of the whole Uni- 
verse, so that it is plainly impossible to add a lens without changing 
the configuration of source and observer. The 'reference situation' 
in the standard explanation is the one in which the mass of the 
lens is redistributed evenly over the Universe. It can be shown that 
the comparison finds no net increase of photon number then (e.g. 
Weinberg 1976). The reason for this is that two differently lensed 
scenarios are compared. 

Some level of uneasiness remained in the minds of several 
authors, because one may think of situations in which the general 
validity of the theorem would still lead to a violation of photon 
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number conservation. lAvni & Shulamil ^m§) calculated the am- 
plification caused by a point-mass lens by dropping the small-angle 
approximation of light rays with respect to the optical axis (de- 
fined by observer and lens), which is generally used in the standard 
formalism. One might think that far from the optical axis the de- 
flection becomes so weak that the full calculation would add only 
higher-order terms which are not relevant in the end. Still, slightly 
surprising, these authors find that the amplification actually sinks 
below 1, which corrects fo r the amplification excess close to the op- 
tical axis. lAvni & Shulam i (1988) worked with a point-mass metric 
in Schwarzschild coordinates, and fixed the radial coordinate rela- 
tive to the lens for the comparison of lensed and unlensed situa- 
tion. ISchneider. Ehlers & Falcol ( Il992h question the value of such 
a comparison. They argue that there is generally no unique way 
to compare mean fluxes in diff erent space-times . Nev ertheless, ap- 
proximating the calculations of lAvni & Shulamil ( Il98 8) for regions 
close to the optical axi s woul d directly lead to the tangential plane 
approach o f ISchneidej ( figsl) . It is not entirely clear why (if at all) 
the theorem loses its validity far from the optical axis. 

Jaroszvhski & Paczvnski (1996) make similar calculations, 
but keep the metric distance between source and observer constant 
for the comparison. They find that the effect of the lens changes 
the total area of the sphere around the source and can thus (without 
contradicting photon conservation) lead to changes in the averaged 
amplification. They also make a distinction between magnification 
and amplification, which becomes relevant at the accuracy levels 
required far from the optical axis. 

None of these publications actually presents the magnification 
to second order (in terms of the deflection potential) for arbitrary 
angles to the optical axifl They either approximate for small an- 
gles or include only first-order terms. One aim of our work is to 
extend the calculations to include second-order terms also for large 
impact angles. Even more important is the discussion of appropriate 
unlensed situations to act as a reference in order to define magnifi- 
cations and amplifications. 

We will also develop the formalism to treat off-axis lenses 
similarly to the standard small-angle approximation and learn 
which part of the proof of the magnification theorem cannot be 
generalised for this situation. This will be done for arbitrary mass 
distributions and not only for point-mass lenses. 

The outline of this paper is as follows. We start in Sec. |2] with 
a brief review of the standard planar lensing theory, which is de- 
scribed in a tangential plane at the position of the lens. This in- 
cludes the magnification theorem and a simple version of its proof. 
In Sec.[3]we explain our approach of discussing the apparent mag- 
nification in terms of solid angles. In this way we can avoid the 
explicit definition of an unlensed reference situation. Solid angle 
must always integrate to 47t, so that the magnification theorem (in 
these terms) cannot hold everywhere. This discussion is equivalent 
to a reference situation in which the area of the source sphere stays 
constant. 

Section|4T|comprises calculations for a point-mass lens at in- 
finity. We explicitly find that the magnification sinks below 1 far 
from the optical axis. General mass distributions (still with source 
at infinity) are discussed in Sec. 14.21 There we develop the poten- 
tial theory of lenses on the sphere as an extension of the standard 
potential theory in the tangential plane. We find that the Poisson 



equation is modified by an additional source term, which ensures 
the validity of Gauss' theorem on the closed sphere. 

The exact form of magnification matrices on the sphere for 
finite (and possibly large) deflection angles is investigated in Ap- 
pendix |B] For our current discussion we only need the result that 
the magnification matrix in terms of second-order derivatives of the 
potential differs from the usual form only to second order in the de- 
flection angle, so that this difference can be neglected. 

In Sec. [5] we extend our calculations to finite values of the 
source distance and explicitly calculate deflection angles and mag- 
nifications for point-mass lenses. By adapting the potential theory 
to this situation, we find that the correction terms in Poisson equa- 
tion and magnification are directly related to the Newtonian poten- 
tial at the source and observer. 

The subject of reciprocity is covered in Sec. |6] where we ex- 
change the role of source and observer to relate magnification with 
amplification. These two are not the same anymore, so that the sur- 
face brightness is no longer conserved if we define it in a standard 
physical sense. This is a direct consequence of the reciprocity theo- 
rem of Etherington ( 1933) and of the space-time modification at the 
position of source and observer, caused by the gravitational field of 
the lens. In anticipation of this result we try to make a clear distinc- 
tion between amplification and magnification from the beginning. 

After a brief discussion of light travel times in Sec.|7]we sum- 
marise and discuss our results in Secs.[8]and|9] 



2 PLANAR LENSING THEORY 
2.1 Basics 

Here we only recall the most relevant as pects. The com plete the- 
ory can be found elsewhere (e.g. Schneid er et alj[l993) . Close to 
an optical axis (which can be defined arbitrarily, conventionally by 
observer and lens centre), the geometry of a gravitational lens sys- 
tem can be projected on the tangential plane of the celestial sphere. 
This leads to the notion of the source plane for the true source struc- 
ture and the lens or image plane for the observed lensed image(s). 
In these planes we define two-dimensional unit-less vectors which 
are based on angular coordinates on the sphere. We now restrict 
the discussion to static, 'thin' lenses, where the light is deflected 
only very close to the lens plane. We can then interpret the action 
of the lens as a potential time-delay, which affects the light when it 
crosses the lens plane. This direct potential delay causes a deflec- 
tion and thus changes the geometrical light path, which creates an 
additional geometrical delay. The total delay At is usually written 
in terms of the Fermat potential, which comprises the geometrical 



and potential parts in a normalized version 



mpr 
if 



At : 



2 

cDa, ' 



(1) 
(2) 



Here the vectors of the observed image and true source position 
are 9 — (Ox, 6y) and Gs, respectively. The distance parameters are 
angular-size distances between observer and lens/source (Da, D^) 
and between lens and observer (Dds). For simplicity, we assume 



^ Close to the optical axis, the second-order terms become dominant for 
compact masses. 



The potential is defined modulo an arbitral^ additive constant. We will 
generally neglect this constant, even if it diverges. Within equations, how- 
ever, consistent constants are used. 
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that the lens is embedded in a global Minkowski metric. In a cos- 
mological situation, Eq. l|2j would have to be scaled by a redshift 
factor 1 + Zi, but remains valid apart from that. 

We define the apparent deflection angle a to fulfil the lens 
equation 



(3) 



Deflection and time-delay are related by Fermat's theorem: Images 
form at stationary points of the time-delay function. These can be 
minima, maxima or saddle-points. Critical images with degenerate 
Hessian matrix (see below, e.g. Einstein rings of point sources) are 
unstable and shall not be discussed here. We thus take the derivative 
of the Fermat potential in Eq. l[T) and substitute the lens equation 
l[3j to find that the deflection angle is the gradient of the potential, 



a(6>) = ViP{e) 



(4) 



If we calculate the potential from the mass distribution, we can 
prove that it obeys a Poisson equation 



V>(6») = 2k(6») 



(5) 



The convergence k is the projected surface mass density E in units 
of the critical density 



47tG DaDa., 



(6) 



In more detail, we can express the Hessian matrix, consisting of the 
second-order derivatives of in terms of the convergence and the 
two-dimensional external shear 7 = (Tx, 7i;): 



H 



t/jyx 4'yy 



K + ^x 7-y 
7y li-'Jx 



(7) 



2.2 The magnification theorem 

This theorem was pres ented by Schneider ( 1984) and is described 
in a broader context by S chneider et al. Here we only want 

to make a plausibility proof, without claiming to be fully mathe- 
matically rigorous. The local magnification matrix M is the inverse 
Jacobian of the lens mapping, Eq. ([3}: 



90s 
90 



1 H 



The scalar magnification fi is the determinant of this matrix, 



:det(1 -H) = 



-7 



(8) 



(9) 



with 7 = j7j. The sign of fj, defines the parity of the image. 

For large G, the Fermat potential will be dominated by the 
first term in Eq. ([T} for all realistic mass distributions, i.e. it grows 
quadratically with 9. Local mass concentrations will produce min- 
ima in tf) that can diverge for point masses, but can not lead to 
positive discontinuous peaks in t/j. It is thus plausible that at least 
one local minimum of (j> exists. This point is a solution of the lens 
equation and corresponds to a real image. Being a minimum, the 
Hessian of the Fermat potential, 1 — H, must be positive definite. 
The eigenvalues can be easily derived to be 1 — k ± 7, which must 
thus both be positive. If we add the condition that the convergence 
or density k is non-negative, we find < fi~^ < 1. In other words, 
this image has positive parity and a magnification /i > 1. As a 
result, the total magnification of all images corresponding to one 
source position is always greater than unity. 

The fact that lensing increases the observed flux density in all 



directionlfl no matter where the lens is positioned, raises the ques- 
tion how this can be consistent with conservation of total flux. This 
issue is discussed bv lSchneidej ( ll984h . lSchneider et al.l l ll992h and 
references therein. The basic argument is that one has to thoroughly 
define the unlensed situation, relative to which the magnifications 
are calculated. Adding a lens modifies the geometry of the Uni- 
verse, so that the comparison is not trivial. This can be illustrated 
by a thought experiment in which we shrink the size of the Universe 
by some factor. This would move all sources closer to all observers, 
so that they would all see an increased flux density. Still, there is of 
course no contradiction with energy or flux conservation at all. 

An important example for total net magnifications is the case 
of a point-mass lens, wher e the calculations were presen ted long 
before the general theorem feinstein^l 19361 : iRefsdalll 1964h . The to- 
tal magnification of the two image|f| as a function of the angular 
separation between lens and source is given by the standard ex- 
pression 



2m 



6,^9? + 4m 

which is greater than unity everywhere. Here we have defined 



(10) 



4GM _D, 



'ds 



DaD, 



4GM 



(for A > Da, Da. « A), 
(11) 



which is the square of the Einstein radius. We can calculate the 
excess magnification by integrating over the tangential plane: 



d 61s (^tc 



1) 



27tm 



(12) 



The order of magnitude of this excess is plausible, because the lens 
produces significant additional magnifications (of the order 1) over 
a region corresponding to the Einstein ring with radius ^/m,. 

We will later find that the solution of the magnification para- 
dox has to do with terms of /i^^ that are linear in the deflection. 
This can be anticipated by the fact that the excess in Eq. M2\ is 
also linear in m. Counting only the terms linear in the deflection 
(linear in ip, k, 7) of Eq. we find 



Hx 



Hy 



2k 



(13) 



The first-order magnification can only be reduced below 1 if either 
fc is negative or the Poisson equation is modified in an appropriate 
way. We do expect the latter to happen when we leave the tangential 
plane and consider the sphere. The divergence (source density) of 
the deflection field V- a = V^V cannot be positive everywhere, 
because of Gauss' law. There simply is no chance for the flux of a 
to escape from the closed sphere, so that the integrated divergence 
must vanish. 



3 SOLID ANGLES 

To avoid most of the complications of arbitrary definitions for ref- 
erence situations, we want to interpret the apparent paradox in a 



^ For the moment we assume the equivalence of magnification and ampli- 
fication, which can be proven in this framework. 

A real Schwarzschild lens would additionally produce an infinite number 
of images very close to the lens (e.g. Virbhadra & Ellis 2000). Something 
similar, although not quantitatively the same, happens with the deflection 
angle according to Eq. j24t . These images are extremely demagnified and 
can be safely neglected here. 
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different way. Instead of discussing fluxes, we want to investigate 
how solid angles are mapped from the observed image sphere to 
the true source sphere, which we put at an infinite distance, where 
the metric perturbation by the lens can be neglected. Using solid 
angles instead of areas on the source sphere, we can avoid a possi- 
ble scaling of the total area. The total solid angle always remains 
4n. 

We do not restrict the discussion to small regions around the 
optical axis (corresponding to the tangential plane) but instead al- 
low for arbitrary impact angles. 

3.1 Conservation of solid angle 

We start by integrating the total apparent solid angle on the image 
sphere and transform this into an integral over the source sphere. 
For this we assume that all lines of sight end somewhere on the 
source sphere and that every part of the source sphere can be seen 
in at least one direction, i.e. we neglect possible horizons which 
would add higher order corrections. 



47t : 



da(6's)M.o.(6's) 



6(6,) 6(6,) 



(0) 



(14) 



(15) 



The sum is taken over all images corresponding to the respective 
source position. We learn that the total magnification, defined in 
terms of solid angle elements, averaged over the source sphere must 
be equal to unity. Even if it is valid close the the optical axis, the 
magnification theorem cannot hold in other regions where the ex- 
cess has to be compensated. 

We conclude that if the magnification theorem would hold in 
this situation, we would have a real paradox that had to be taken 
seriously. We will learn later that the magnification theorem does 
indeed become invalid for large impact angles. In these regions the 
primary image is only very weakly deflected. Any additional im- 
ages will be strongly deflected (invalidating the weak-deflection ap- 
proximation), but highly demagnified. In the following discussion 
we will mostly neglect these images, which allows us to consider 
the magnification as a function of image positions instead of the 
total magnification as a function of source position. Fig. [3] shows 
that this simplification is justified. 

Figure [T] illustrates why the magnification theorem can be 
valid around the optical axis but not everywhere. We see that the 
lens increases the density of lines of sight on the source sphere 
around the (forward) optical axis, corresponding to a magnification 
greater than unity. On the tangential plane, this would be possible 
everywhere, because lines of sight (or solid angle on the sky) can 
be 'borrowed from infinity'. In contrast to this, the total solid angle 
must be conserved on the compact sphere. This necessarily leads to 
magnifications less than unity somewhere, seen in the lower parts 
of the diagram. Note that, for simplicity, a lens without multiple 
imaging is shown here. The lens mass distribution was chosen in 
such a way that the magnifications can be seen already in this two- 
dimensional cross-sectional view. Generally the density of lines of 
sight only shows in the full three-dimensional geometry. 



3.2 Reference situations and the refraction analogue 

We now know that for solid angles, measured for sources at infin- 
ity, the magnification theorem cannot hold. However, we have to 
be very specific about the situations that are compared to define a 




Figure 1. Deflected lines of sight with the observer in the centre and the 
source sphere as circle. The vertical line defines the optical axis, the hatched 
circle shows the lens. The regularly spaced dots on the source sphere label 
the corresponding image positions (straight projections of the lines of sight). 



magnification. The magnification according to Eq. l ll5t defines the 
ratio of solid angles of a cone of light rays measured by the observer 
and at infinity. In astronomy we are interested in the ratio of solid 
angles of one and the same source seen with and without the action 
of the lens. Naively, one would define the unlensed reference situ- 
ation in such a way that the distance between observer and source 
is kept constant, but distances are not uniquely defined in a curved 
space-time so that this approach is not unique. The picture of solid 
angles and a very large source sphere can also be interpreted as a 
reference situation in which the total area of the source sphere is the 
same in the lensed and unlensed situation. Flux conservation then 
demands a mean magnification of unity, so that general validity of 
the magnification theorem cannot be expected. 

We want to postpone the discussion of an appropriate refer- 
ence situation by redefining the problem. It is well-known that the 
gravitational light deflection can be described as the action of a 
refractive medium that fills an Euclidean space, whose coordinates 
are identified with those used in the standard form of the weak-field 
metric equation l ll6t . In this model, the unlensed situation would 
obviously be the one with all distances unchanged but the refrac- 
tive medium removed. Solid angles can then directly be identified 
with areas on the source sphere, so that the reasoning about solid 
angles in Sec. l3.1l inevitablv leads to a failure of the magnification 
theorem in this situation. Since this contradicts the result of Sec.|2] 
even though the formalism is exactly the same, we have to iden- 
tify and abandon the inadmissible assumptions or approximations. 
Once the formalism has been generalised, we can come back to the 
definition of reference situations. 

As an alternative to refraction we can refer to the Newtonian 
picture of the deflection of massive particles moving with the speed 
of light. Modulo a factor of 2, the deflection follows exactly the 
same law as in general relativity, but the geometry of space and 
time are not altered. 



4 SOURCES AT INFINITE DISTANCES 
4.1 Point-mass lens 

We know that close to the optical axis the magnification theorem 
holds. From the discussion of Fig.[Tl we expect that it breaks down 
at larger impact angles. Therefore we have to generalize the usual 
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formalism to allow source positions far from the lens centre. How- 
ever, we will keep the weak-field approximation so that deflection 
angles still have to be small. 

We describe the lens(es) as small perturbation of an asymp- 
totic Minkowski metric instead of a more realistic Robertson- 
Walker metric. This simplifies the treatment without obscuring the 
apparent magnification paradox or its solution. We use the static 
weak-field metric in isotropic coordinates, so that locally observed 
angles (and solid angles) can be derived directly from the spatial 
coordinates r = {x, y, z). In terms of the Newtonian potential 5', 
the line element can be written as 

d.^^(l + f^)cW-(l-5)d.^. (16) 

This leads to the geodesic equation for light in first order approxi- 
mation 

r = -2^Vl* , (17) 

where Vl denotes the transversal part (relative to r) of the gradi- 
ent, and dots stand for derivatives with respect to the affine param- 
eter. Within first order (in ^'), the geometrical length or local time 
can also be used equivalently. 




Figure 2. Geometry of the light deflection with a source at infinity. The 
angles d, 9s and a are positive in our sign convention. Here and in all fol- 
lowing geometrical considerations, we pretend that space is Euclidean (and 
space-time Minkowski) and treat the light deflection as a non-geometrical 
effect. The figures represent the geometry of the coordinates used to define 
the metric in Eq. )16t . Isotropy of the metric ensures that all angles corre- 
spond to real locally observable angles. Straight lines in the figures are not 
necessarily geodesies in real space or space-time. 

4.1.2 Potential 

The deflection angle in Eq. i24l can be written as the derivative of 
a potential tp, which is defined on the sphere as 

'4's>m{d) = m In sin ^ = ^ ln(l — cos 9) + const . (25) 



4.1.1 Deflection angle 

We want to calculate the deflection angle of a point mass M with 
a source at infinity. We restrict ourselves to the plane defined by 
lens, source and observer, which contains the complete light ray. 
The coordinates have their origin at the observer, z is measured 
along the unperturbed line of sight, x in the perpendicular direction 
(Fig. [2I1. The deflection angle is defined according to the standard 
form of the lens equation The lens, located at 



xo — —Dd sin 6 , zo — Dd cos ( 
produces a Newtonian potential of 

'if{x, z) = 



GM 



(18) 



(19) 



- Xo 



Iz — Zo) 



The deflection angle is calculated by integrating Eq. l ll7b along the 
unperturbed line of sight. It is defined to be positive for deflection 
in the negative z-direction. We use z as affine parameter: 



a = — Ai : 



dz x{x — 0, z) 



— / dz\7±<S'{x^0,z) 
c Jo 

2GM r 2^. .2 
^3— Xo / az ^Xo + (z - zo ) 



-3/2 



2GM l + zo{xo + Zo) 

c2 Xo 

m 1 + cos 9 m 6 

— — — cot - 

2 Di sine 2 2 



2\-l/2 



(20) 
(21) 
(22) 

(23) 
(24) 



In the limit of small 6, i.e. close to the optical axis, we recover the 
standard result a — m/9, with the definition of m from Eq. i ll It . 

Note that for large 6 the deflection is not confined to regions 
small compared to Dd, but takes place over a range ~ |2;()|. This 
means that not even the point-mass lens can be considered as thin 
anymore. 



4.1.3 Magniflcation 

Consider as source a thin concentric annulus around the lens, as 
seen by the observer. With radius and width AOs, the solid angle 
is AQ.S = 27Tsin6's d^s. This annulus will be observed at radius 
9 with width d^ and solid angle AO, — 2n sin 8 AO. We separate 
the magnification into the tangential and radial parts, Adj and Mr, 
respectively: 



IJ- = 
Mf ^ = 



Afls 
djT 






sin ( 


% 


sin(^ — a) 


sin 1 


9 


sin 9 


AOs 
'AO 




da 
~ AB 


1 + 


m 
T 


. -2 

sm - 



(26) 

— cos a — cot 9 sin a (27) 
(28) 
(29) 



The magnification we get from Eqs. ( l26H28b with the deflection 
angle from Eq. J24t is plotted in Fig. [3] The deflection angle and its 
derivative are correct only to first order in m, so that the first order 
is also sufficient for the radial and tangential magnification^. Up 
to this order, we find 



Mf ^ = 1 - 



-^ = 1 



a cot ( 
m 

2" ^ ' 



4 (1- cos 0)2 



(30) 
(31) 



For large 9, the m/2 term in Eq. Olt is dominant, whilst in the 
normal lensing regime, the term becomes more important. Ex- 



This is not true for the scalar magnification (the determinant of the 
magnification matrix), where the second order terms, resulting from combi- 
nations of first order terms in Mj and A/r, are responsible for the mag- 
nifications in the classical lensing regime close to the optical axis. It is 
ironic that our generalisation of the standai'd formalism introduces addi- 
tional terms of first order to fi, whilst the dominating classical part is of 
second order 
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Figure 3. Magnification of both images (positive and negative parity) and the total magnification as a function of the source position for a lens of m = 
— 2.35 ■ 10~^^ . For small angles (left), the magnification is indistinguishable from the classic result. Far from the optical axis (light), the magnification 
approaches I — m/2 instead of 1. The magnification theorem does not hold in this region. The total magnification and the magnification of only the positive 
parity image are indistinguishable for very large impact angles. This justifies our approach of not discussing the total magnification of all images but only that 
of the primary image (/i+). 



panded in powers of 6, we find 



to find 



= 1 + 



The corrections in the small-angle regime (last term) are not rel- 
evant in our context. The isotropic correction m/2, on the other 
hand, leads to violations of the magnification theorem. Far away 
from the optical axis, the magnification sinks below unity. The 
transition region with a magnification of 1 is around 6 = ■\/2m, 
approximately the square root of the Einstein radius. For typical 
galaxy lensing cases, this is a few arc-minutes from the lens, far 
away from the strong-lensing regime but still at a small angle. For 
microlensing, the transition region is even closer. 

The small correction of the order m (10^^" in typical galaxy 
scale lenses) is completely irrelevant for all practical calculations. 
Nevertheless, this term is responsible for the conservation of total 
flux or solid angle. Integrated over the complete celestial sphere, 
it leads to a deficit of 27tm + 0{m^), which exactly compensates 
for the excess found in Eq. il2\ . Because the modification is of first 
order in m, the following discussion will concentrate on first-order 
effects. 



In the tangential plane, this corresponds to the integral 
= ^ [[ d^^' <y{0') In \0 - 0'\ + const 



(36) 



(37) 



Even though this formalism is valid for arbitrary mass distributions, 
it is correct only to first order in p. This means it cannot be used for 
multi-plane strong lensing, where the change of impact parameter 
in one lens plane as a result of the deflection in another plane be- 
comes relevant. 



4.2.2 Poisson equation 

The Poisson equation for the spherical potential ^j) can be derived 
with very little formal calculations. We determine the flux F of the 
deflection field a through a circle of radius 6 around the point mass 
m, corresponding to ct{0) — nmS^iO). This is the product of a 
with the circumference of the circle. 



4.2 General mass distributions on the sphere 

4.2.1 Potential 

Knowing the potential for a point-mass lens from Eq. l l25t . we can 
easily generalize for arbitrary mass distributions, defined by the 
three-dimensional mass density p{r): 

/(0,r') 



(33) 



2G 

r.2 



dV ^ ^ 1 + const (34) 



The direction is denoted by the unit vector 6 with r = Or. We 
separate the radial integration from the tangential part on the sphere 
by introducing the normalized surface mass densit}!^] 



dr r p{r) 



(35) 



° Note that in this radial projection the influence of mass elements at dis- 
tances r does not scale with r but with l/r, see Eqs. )33ll34t . 

For a thin shell around the observer at a certain distance Di, we find 
= {i7iG/c^)Di J dr p{r) = S/Sc as in the standard formahsm. 
Compare with Eq. (6). 



F{e) = a{e) 2nsme = 7tm(l + cos( 



(38) 



In the limit of 6 —* 0, this starts with F{0+) — 2nm and then 
decreases linearly with the enclosed area A of the circle: 



F{e) = 2nm - yA(6») A{d) = 27t(l 



iO) (39) 



We know from Gauss' theorem that this flux equals the integrated 
divergence of the field (= Laplacian of the potential), so that we 
can write the Poisson equation as 



VVpm(0) = 27tm<5'(0)- y 



(40) 



The 5 term is the same as on the tangential plane. The mass acts in 
the usual way as source for the deflection field. However, in contrast 
to the tangential plane, the sphere is closed, so that sources and 
sinks of the field must compensate each other; the field lines cannot 
extend to infinity. Without the second term in Eq. ( I40t . the field 
lines would continue and meet at 6* = 7t where they would form 
an additional singularity of mass — m. Instead, the field 'decays' to 
ensure a vanishing total integrated V^V- 

For an arbitrary mass distribution a, the Poisson equation 
reads 



V^i/)(0) = 2 [<j{e) - a] =: 2k{0) 



(41) 
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We learn that not a itself is the source of the field but the difference 
of a and the mean surface mass density 



a = — * dn(0)o-(0) 
An 



(42) 



In Appendix I All we show how the spherical lensing potential 
can be written directly as integral over the Newtonian potential. It 
is the three-dimensional Poisson equation that leads to the first term 
in Eq. J41t . The second term is related to the Newtonian potential 
at the position of the observer. This tells us that the deviations from 
the standard formalism can be related to local distortions of space- 
time. 



4.2.3 The magnification matrix 

The (inverse) magnification matrix is given by the Jacobian of the 
lens mapping. On the tangential plane, the Jacobian of Eq. l[3j leads 
directly to the Hessian of the potential, see Eq. l[8]l. The situation is 
more complicated on the curved sphere, where finite displacements 
cannot be treated as vectors. In Appendix|B]we calculate the mag- 
nification matrix for arbitrary deflection functions a.{6), including 
large deflection angles. In terms of this section, where the deflec- 
tion angle a is the gradient of a potential ?/>, the exact magnification 
matrix from Eq. l |B9l l can be written in coordinates (|| , -L) parallel 
and perpendicular to the negative deflection angle —a, as 



(43) 



The lower indices of ijj denote the second-order (covariant) deriva- 
tives with respect to the coordinate axes. The corrections in cos a 
and sin a/a are of second order in the deflection a. These terms 
can be set to unity if the magnification matrix is needed only to 
first order. The result is then equivalent to the planar formalism in 
Eqs. ([7}{8j, but with the modified spherical potential. 

For a point mass at a distance 9, the Hessian of t/j is diagonal 
witl{3 



i/'x;x = ij0 cot t 




(44) 



(45) 



cos a — cot sm a > 



This matrix corresponds exactly to Eqs. l l27M29t . Neglecting the 
curvature terms in M^^ corresponds to the linear approximation in 
Eq. ( 1301131b . where only the term linear in a was included for the 
tangential magnification. 



4.2.4 Magnification theorem 

We have now, to first order, derived the magnification matrix as the 
Hessian matrix of a potential, analogously to the standard formal- 
ism. We can thus formally follow the proof for the magnification 
theorem as described in Sec. 12.21 step by step. However, the as- 
sumption of positive (effective) k is no longer true, see Eq. J41b . 



7 



This can be derived from the covariant derivatives in 



nates, Tg.g = Tg 
dj_ = dci>/ sinO. 



e, = T^,4, + sine cos 6lTe, Tg.^ = 

together with the scaling transformation 61 



) coordi- 




Figure 4. The apparent deflection angle a is defined as the difference be- 
tween the apparent and true source position. Changes of the direction of the 
light path will therefore affect a scaled with Dis{z) / Ds- 



so that we cannot infer magnifications greater than unity from the 
signs of the eigenvalues. Even though the density (even in compar- 
ison with the reference situation) is still strictly non-negative, the 
convergence is not, which invalidates one of the assumptions for 
the proof of the theorem. 

Recall that, in order to define a firm foundation for this rea- 
soning, we refered to non-relativistic situations with unperturbed 
geometry in which the deflection is caused either by refraction or 
by Newtonian deflection. If we now return to the relativistic sce- 
nario but define the infinitely large source sphere in such a way that 
the area is the same for the lensed and unlensed situation, we come 
to exactly the same conclusion for that case. 



5 SOURCES AT FINITE DISTANCES 

In the following we allow for sources at finite distances. In this sit- 
uation, the metric on the source sphere is modified by the lens, so 
that the previously used approach is not possible anymore, but as- 
sumptions about the unlensed reference situation have to be made. 
We do this by fixing the coordinate distances, which would also be 
the appropriate approach for the model of refraction or Newtonian 
deflection in an Euclidean metric. Modifications of the results for 
different conventions (like fixed metric distances) will be discussed 
later. 

Main aim of this section is the analysis of lens deflection and 
the potential theory on the sphere as opposed to the planar theory. 
For completeness this has to be generalised to finite distances. Most 
of the discussion of the magnification theorem, on the other hand, is 
based on the case of sources at infinity, because some arbitrariness 
in defining a reference situation can be avoided then. 



5.1 Deflection angle 

If the source is at a finite distance from the observer, the sim- 
ple geometry in Fig. [2] has to be modified. If we want to keep the 
form of the lens equation the apparent deflection angle a is 
no longer defined as the angle between the incoming and outgoing 
light ray, but as the difference between apparent and true source 
position (Fig.|4ll. In the thin-lens approximation, this leads to the 
well-known correction factor of Dds/fs to transform the true to 
the apparent deflection angle. As mentioned earlier, far from the 
optical axis not even the point-mass lens can be considered as be- 
ing thin, which means that Dds is not defined globally. If we in- 
stead consider how a local change of x in Eqs. l l20t and follow- 
ing affects the difference between true and apparent source posi- 
tion, we find that the integrands have to be scaled with the local 
Dii{z) /Ds — {Ds — z)/Ds.. In addition, the integration has to end 
at z = Ds. With the same approach as before (A = z), we find for 
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Figure 5. Definition of the distance parameters _Dd, Ds, -Dds and tlie angles 
8, 6' , and for a compact lens at L. Observer and source are denoted by O 
and S. Recall that, in concordance with our general philosophy, the lengths 
and angles are in our convention defined in an Euclidean metric with the 
same coordinates as in the general metric of Eq. (16). (See caption of Fig.|2]) 



the deflection angle of a point-mass lens 



Jo 

2GM 



dz 



0,z) 



(46) 



Z \ 2 



-3/2 



2GM 



xoDs 



(A -zof + xl 



V4 



+ (A - zoY 



+ 



(47) 

20 (A - Zo) - xl 



V4 + 



_ 2GM As (61) + A cos 9- Di 
^ ~ DiD, sin e ' 

with the coordinate distance between source and mass 



Di,{e) = y/D'i -2AA cos( 

In the limit of small 6 this reduces tcjf] 



AGM 



A ~ A e(A - A) 
A A 



zl 
(48) 

(49) 
(50) 
(51) 



For a source behind the lens, we recover the classical limiting case 
with the appropriate scaling of the apparent deflection angle. If the 
lens is further away than the source (which we may call background 
lensing), the singularity at ^ = vanishes and the deflection gener- 
ally becomes much weaker, just as expected. In this case the maxi- 
mum effect will be reached at some finite 6. 

Note that in contrast to the standard expressions, Eq. ( 149b is 
valid for all combinations of distances. Even Ds ~ A is possible, 
including the limit of self -lensing with Ds = Di. 



5.2 Magnification 

Later we will compare magnification with amplification, which cor- 
responds to an exchange of observer and source, i.e. 6^6' and 
Di <-> Dds (see Fig. [5}. For this purpose, we formulate the magni- 
fication in a form that explicitly shows the symmetries. We use the 
trigonometric relations 



Df = Dd^ + Dd^, - 2DdDds cos < 
Ds sin 6 — Dds sin <j!> , 
"'ds cos ( 



Ds cos e = Di-D, 



(52) 
(53) 
(54) 



* The Heaviside function is defined as &{x) ■ 
otherwise. 



lioTX > Oande(x) = 



to write the tangential magnification Mj following Eq. I l30t for the 
deflection angle in Eq. l |49t as 



, GM ( 1 1 

=^ + — a-a: 



Dd + Dds ^ 2 

— — — tan — 

DdDds 2 



(55) 



The radial magnification can be determined from the derivative of 
the deflection angle, see Eq. J28t . 



-, GM , ^ 
1 + ^ -- 



1_ 

Di 



1 Dd + Dds ^ 2 

t:: tan — 

Dds DdDds 2 



To first order in M, the scalar magnification is given by 



A* 



2GM 



Di 



1 

dZ 



(56) 



(57) 



Note that for these calculations we assumed that the coordi- 
nates of source and observer are the same in the lensed and un- 
lensed situation. For finite Ds, the lens will modify the geometry 
of the source sphere, which makes the definition of the unlensed 
reference geometry somewhat arbitrary. While coordinates can be 
fixed for point-like sources, this is not appropriate for the compo- 
nents of extended sources. For those, we should keep their physical 
size constant to allow for a meaningful comparisorQ 

From the metric in Eq. jI6b we infer that the ratio of area to 
solid angle (on the source sphere) changes isotropically with the 
potential \l/s at the source position: 



d^s 



2*, \ 9 



(58) 



Using this we can define the true area magnification fi^ A) as a func- 
tion of the solid angle magnification /i{n), which we called /j, be- 
fore: 



-1 
M(n) 



dQs 
dfi 



_i _ 1 dAs _ ( 2*s 
^(■^' " D? dJ7 " ' ^ ~ 



) Mro) (59) 



The radial and tangential magnifications Mr and Mj both have to 
be corrected with the same factor, which is the square root of that 
for the scalar magnification, so that 



-1 
r(A) 



r(n) 



£s 
75" 



Dds 



We now apply this correction to the magnifications from Eqs. | |551 - 



r(i) 



GM 



A 



Dd + Dds ^ 2 
— „ „ — tan — 
DdDds 2 



, , -1 , ^ GM ( 1 , Dd + Dds , 2 . 
^^{A) = H — I — ~l tan -r 



Dd 



DdDds 



The first-order scalar magnification is 



-1 

M(A) 



2GM 1 



Dd 



(61) 
(62) 

(63) 



All these magnifications refer to an unlensed situation with the 
same coordinate distance to the source. As alternatives we discuss 
fixed affine and metric distances in Appendix Icl 



^ Note that lAvni & Shulamil jl988h . who followed light rays from the 
source to the observer, took the changed geometry into account on the side 
of the source but not on the side of the observer Even though they work in 
Schwarzschild coordinates, where the suiface of a sphere with a constant 
radial coordinate r is always 47tr^, this should not be neglected, since the 
comparison situation is a sphere around the source and not around the lens. 
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5.3 Potential and Poisson equation 

The potential can be derived by integrating the point-mass deflec- 
tion angle in Eq. l l49t . or alternatively by integrating over the ap- 
propriately scaled three-dimensional potential ^, as shown in Ap- 
pendix |A3] with the result in Eq. l lA17b . 

In Appendix I A2I we derive the Poisson equation by integrat- 
ing over the three-dimensional potential derivatives, and find the 
relation 



VV(6>) = 2cr(0) +2 



*o - *s(6>) 



(64) 



where ^'o is the Newtonian potential at the observer. The projected 
surface mass density is now defined as 



47iG 

^2 



A- 
dr r — — — 

A 



■p{r) 



(65) 



The weight function in Eq. l l65t has the shape of a parabola with 
maximum at r = -Ds/2 and zeros at r = and r — D,. How- 
ever, since the apparent size of a mass clump scales with the 
influence of a mass element at r is proportional to 1/r — l/Ds- 

In classical lensing theory, the divergence of the deflection an- 
gle leads to the local surface mass density. Far away from the op- 
tical axis but for sources at infinity, this is changed in the way that 
we obtain in Eq. i4U the density contrast relative to the mean den- 
sity, which results from the gravitational potential at the position of 
the observer (see Appendix lAlt . Now we find in Eq. ( I64t for finite 
Ds that the divergence is also affected by the potential at the source 
position (see Appendix I A2t . These two contributions describe the 
influence of all masses, while the surface mass density only in- 
cludes masses inside of the source sphere. The projected surface 
mass density a vanishes completely for Ds ^ Dd, i.e. in the case 
where the source is closer to the observer than the lens. 

We recall from Sec. l2.2l that the divergence of the deflection is 
directly related to the first-order scalar magnification. The same is 
true here, but the corresponding magnification now refers to solid 
angles instead of area. When writing Eq. i64\ for the point-mass, 
we find that the potential terms lead directly to the correction terms 
in Eq. ^ if we follow the same recipe as in Eq. ( 1131 ). 

As consistency check we should test if the integral over 
Eq. ( I64t vanishes, as required by the compact geometry of the 
sphere. We know from Gauss' theorem that the potential of a ho- 
mogeneous spherical shell is equivalent to that of a point-mass for 
regions outside of the shell, and constant inside. We can turn the 
argument around, to learn that the potential averaged over the sur- 
face of any sphere is the same as that in the centre of the sphere, 
if masses are located only outside of the sphere. These masses do 
not influence the integral of the difference "to — '^s in Eq. i64i . For 
masses inside of the sphere, on the other hand, the averaged poten- 
tial is independent of their location, so that the total mass can be 
thought of as being in the centre: 

< *s > = -G^dfi^ 'drr^^ -fC (66) 

*o = -G^dO ^ 'drr^^ +C (67) 

The contributions from masses outside of the sphere are denoted as 
C. By comparison with Eq. {65}, we find 

< to - *s > = -c^ < o- > . (68) 

As required, the average source density in Eq. ( 164b vanishes. 

Note that for Ds 2> Dd all expression in this section (and in 



Appendix IA2HA3t converge to the ones for infinite Ds derived in 
the previous sections and Appendix I All The distinction between 
and is not necessary in this limit. 



6 RECIPROCITY AND SURFACE BRIGHTNESS 



The magnifications in Eqs. ( 16111631 1 are not invariant under the ex- 
change of source and observer. Writing /.i(^)(o, s) — /ij^j for the 
magnification of the source as seen by the observer and /i(^) (s, o) 
for the magnification of the observer as seen by the source, we find 
(by exchanging Dj ^ Djs) the following relation, which is correct 
up to first-order terms in the tangential and radial magnifications. 



M(A)(S 



2GM 



A'(A)(o,s) 



1_ 



1 
Dds 



— - — (69) 



What does this mean physically? The magnification ^(^j (o, s) de- 
fines the scaling of the apparent size of a magnified source as seen 
by the observer. The reciprocal fj,^A) (s, o), on the other hand, is in- 
versely proportional to the area in the observer plane that is spanned 
by a certain light bundle; that means it defines the amplification in 
terms of the number of photons received from the source by the 
observer per detector area element. 

The ratio of amplification to magnification in Eq. ( I69t pro- 
vides the gravitational change of 'surface brightness', measured as 
photon number density per solid angle. In this sense, gravitational 
lensing does not conserve surface brightness. 

This may seem surprisin g but is in perfect ag reement with the 
reciprocity theorem derived bv' EtheringtonI ( Il933fr°l If one defines 
in an arbitrary space-time for two events x and t/, which are con- 
nected by a null-geodesic, i.e. one can be seen by the other, the 
angular size distances D^y for the distance of y as seen by x and 
vice versa, it can be shown that the two are related by 

Dxy 



1 + Zx 

where z are the redshifts as measured by an arbitrary observer. 

In our situation, the lens magnifications act as corrections to 
obtain the effective angular size distances from Dsi 



1 

D^" 



M(A)(o, s) 



1 



M(A)(s,o) 



(71) 



The gravitational redshifts produced by the field of the lens can be 
derived from the metric, Eq. ( I16t . as 



cdt 
"57 



1-^ 



1 + 



GM 1 



D 



(72) 



1 + z 

for a point mass at a distance D. 

We can confirm Eq. i69\ by taking the ratio of Eqs. fill and 



inserting Eq. ( I70t and the redshifts from Eq. ( I72t : 



M(A)(s,o) _ Dl 

M{A)(0,S) 



l + Z„ 



, 2GM 
1 + ^^ 



/l 



+ ■ 



GM 1 



Dd 



GM 1 



1 + 



D. 



'ds 



1 



1 



(73) 



(74) 



So far, we have discussed the surface brightness defined as 
photon number per solid angle. If we want to determine it in terms 



" see also lEllidj2007h and the mo re easily available republication of the 
original article as lEtheringtonl i2007l) . 
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of energy flux density F, we have to take into account that both the 
energy of the individual photons and the arrival rate at the observer 
are affected by the redshift. This provides another factor of (1 + 
Zo)^/(l + Zs)^, so that the surface brightness in terms of intensity 
scales with 



-Fobs 

"FL" 



1 



1 + 



1 + ' 



4GM 



1_ 



_1 



— - — . (75) 



The scaling of the surface brightness with (1 + z)"* is, of course, 
a well-known fact in cosmology. Here we have exactly the same 
effect, but caused by the metric perturbations of a gravitational lens 
at finite distance. In any case, we should keep in mind that the effect 
is extremely small, typically of the order m or the square of the 
Einstein radius. For typical galaxy-scale lenses with Einstein radii 
of the order ~ 1" this corresponds to ~ lO^^^-lO"^''. 



7 LIGHT TRAVEL TIME 

We separate the light travel time into three parts: The undisturbed 
travel time Ds/c, and the geometrical and potential parts of the 
time-delay, Atgcom and Afpot, respectively. The potential part is 
easy to calculate. We start with the metric from Eq. ( I16t and write 
the (coordinate) time interval for a null-curve along the z-axis as 



df = 1 - -r — 



(76) 



For a point-mass lens, the integral is basically the same as for the 
metric distance shown in Appendix |C2] 

f-Ds 



2GM 



dz 'i'{x = 0, z) 



In 



Dds(6') + Dd- A 



+ const 



(77) 



(78) 



We observe that this is proportional to the deflection poten- 
tial in Eq. i25\ only in the limit Ds oo, where we have 
Afpot = — -Dd V'/c. In the general case there is no proportionality 



to Eq. l lAlTl l. 

In fact, we do not expect a one-to-one relation between Atpot 
and the potential ip as in the case of thin lenses. The argument used 
in Sec. 12. 1 1 to explain why the deflection angle is proportional to 
the gradient of the potential time-delay breaks down, because the 
geometrical part of the time-delay does not keep its simple form 
of Eq. (TJ. It carmot be expressed in terms of image and source 
position alone anymore, because the deflection is not restricted to a 
lens plane (or sphere). In order to determine the geometrical delay, 
we have to know the full three-dimensional mass distribution and 
not just its projection a or the projected deflection potential xp. 

Because it cannot be easily generalized for arbitrary mass dis- 
tributions, we do not present the calculation of Atgeom for a point- 
mass lens. 



8 SUMMARY 

We discuss the classical magnification paradox in gravitational 
lensing in order to better understand if and how a magnification 
greater than unity everywhere can be consistent with global pho- 
ton number conservation. It is of fundamental importance to allow 
for large angles relative to the optical axis. Therefore we develop 
a formalism of lens and source spheres instead of planes. In this 
situation, even a point-mass lens cannot be regarded as a thin lens 



anymore. The 'thickness' of a lens is not defined by the extent of the 
mass distribution but by the extent of the deflecting potential and 
its derivatives. The thickness of a point-mass lens is thus closely 
related to the impact parameter and can be of the same order of 
magnitude as the distances involved. 

To avoid ambiguities, we start by considering a source at infin- 
ity, so that the geometry on the source sphere is not changed by the 
action of the lens, and the area of the source sphere is not modified. 
This makes a comparison with the unlensed situation meaningful. 
We calculate magnifications by mapping solid angle elements from 
the lens sphere to the source sphere. Because we do not include 
horizons around black holes, which are corrections at higher order, 
the total solid angle of the whole sky must be 47t with and without 
lensing. Lensing defines a one-to-many mapping from the source 
sphere to the lens sphere. We see the background sky in all direc- 
tions, no parts of the background are hidden, but some parts may 
be multiply imaged. With this argument, we conclude that the total 
magnification cannot be larger than unity for all positions on the 
source sphere, in contradiction to the magnification theorem. This 
thought is supported by a simple geometric argument. The density 
of deflected lines of sight on the source sphere (or equivalently light 
rays on an observer's sphere around the source) can be increased 
close to the optical axis, because a lens with positive mass has a fo- 
cusing tendency. In the approximation of the tangential plane, this 
can be true everywhere, since lines of sight (and solid angle ele- 
ments) can be borrowed from infinity. Once we consider the com- 
plete sphere, this is no longer possible. Solid angle elements that 
are moved towards the optical axis must be re-moved from other 
parts of the sphere. The magnification cannot be larger than one 
everywhere. For this argument it is essential to work with solid an- 
gles on a source sphere at infinity. With a sphere at finite distance, 
the area density of projected lines of sight can well increase every- 
where, simply by making the area of the sphere smaller. Masses 
modify the geometry, so that mean magnifications above one are 
no longer paradoxical. The further the source sphere moves away, 
the weaker these distortions get. For a finite mass, they decrease 
without limits and can finally be neglected. 

The non-relativistic pictures of refraction and Newtonian de- 
flection in unperturbed geometry support this view. In these scenar- 
ios a paradox cannot be avoided if no corrections to the formalism 
are made. 

With this motivation, we calculate the deflection angle for a 
point-mass lens for arbitrary angles to the optical axis with source 
at infinity and derive magnifications from that. We find an addi- 
tional first-order term that lowers the magnification, violates the 
magnification theorem at some point, and assures total conserva- 
tion of solid angle. In order to understand which part of the clas- 
sical proof of the theorem becomes invalid once we go from tan- 
gential planes to full spheres, we develop the potential theory for 
arbitrary mass distributions on the sphere. As in the plane, we can 
define a two-dimensional lensing potential that is a projection of 
the three-dimensional Newtonian potential. In the same way we de- 
fine a projected surface mass density on the sphere. We find that the 
Poisson equation is modified in an interesting way. In the plane, the 
divergence of the deflection is proportional to the surface mass den- 
sity. This cannot be true on the sphere, because flux of the deflec- 
tion field cannot escape from the sphere. The integrated divergence 
must vanish in order to obey Gauss' law. We do indeed find that 
the divergence is 2((7 — a), the density contrast relative to the aver- 
aged density a. The field lines corresponding to the deflection field 
'decay' with increasing distance to the masses. This fact provides 
a very clean and firm background for a correction of the standard 



© 0000 RAS, MNRAS 000, 000-000 



From planes to spheres 1 1 



1 1 1 










.4 coord ~~ v^ 

A affine ^^ \ 

A metric '\ \ 

P coord \ \ 

P metric \ 
1 1 







1 


T 1 

A coord 




A affine 




A metric " 




P coord 




P metric 







Figure 6. Plots of several first-order magnifications defined in different ways for a point-mass lens. Labels A refer to area magnification, P to photon number 
amplification. Different distances are held constant for the comparison with the unlensed situation: coordinate distance, affine distance and metric distance. 
For 'A coord', we added the second-order magnification calculated for GM j D^j (? = 10~*, which has no noticeable effect for Ds < Di. left: Ds = SDj, 
right: D, = Di/3 



solution of the paradox, according to which the magnification theo- 
rem only holds compared to a reference Universe in which the lens 
is removed. If we instead redistribute the mass of the lens smoothly 
to create a reference situation, the effective density contrast can be- 
come negative, so that validity of the theorem is not expected any- 
more. Using the formalism on the sphere in a perturbed Minkowski 
metric, we directly arrive at the result that it is the density contrast 
relative to the average which creates deflection and magnification. 
This is true even though we use the Minkowski metric with the 
lens removed as our reference situation. We learn that at least in the 
asymptotic Minkowski metric, it does not make a difference if we 
redistribute the mass of the lens over the lens sphere for compar- 
ison, or if we simply remove it. A globally constant surface mass 
density does note lead to light deflection^ 

In the spherical potential formulation, the magnification ma- 
trix can be calculated from the second-order derivatives of the po- 
tential, just as on the plane. This neglects higher-order terms, which 
are not relevant in this context. However, in the Appendix we calcu- 
late exact magnifications for arbitrarily large deflection angles. For 
this the curvature of the sphere has to be taken into account fully. 
The first-order formalism is analogous to standard planar lensing. 
Nevertheless, the magnification theorem becomes invalid, because 
the effective convergence n = a ~a i?, not strictly positive. Posi- 
tivity of K was an essential assumption for the proof, so that we can 
no longer expect the theorem to hold on the sphere. 

To be able to switch the role of source and observer, so that 
magnification becomes amplification, we have to allow sources at 
finite distances. We do this be defining a source sphere with fixed 
coordinate distance from the observer, using the isotropic weak- 
field metric. We calculate the deflection angle of a point mass and 
derive the magnification matrix from that. Here we have to take into 
account that the metric on the source sphere is now modified by the 
gravitation of the lens. We therefore have to distinguish between 
solid angle and corresponding metric area. For the first-order area 
magnification, we find the same result as for the source at infinity. 
As before, the Poisson equation has additional terms to 2o on the 
source side. These are given by the difference of the Newtonian 
potential at the positions of observer and source. In the limit of a 
source at infinity, this corresponds again to the mean surface mass 

This does not contradict IWeinber3 jl976l) . where differences between 
the two cases are discussed. In our work, we co nsider a single, iso lated lens, 
which distorts the geometry only locally, while lWeinberd 1197^) discusses 
a Universe filled with lenses. In the latter case, the mean density and the 
global geometry do depend on the presence of the lenses. 



density. The solid angle magnification can be derived from the po- 
tential in the usual way, but has to be corrected by the potential at 
the source position to convert it into area magnification. 

Alternative reference situations are treated as small perturba- 
tions of this scenario, see Fig. [6] For a fixed metric distance, we 
find that the magnification theorem still fails in some regions. This 
is different if we fix the ajfine distance, in which case the magni- 
fication theorem holds everywh ere as a direct result of the focus- 
ing theorem (e.g. |Perlickll2004) . This is even true for sources at 
infinity. However, the affine distance is a highly inappropriate ref- 
erence for astronomical applications, because it is strongly influ- 
enced by local metric distortions at the observer's position. While 
other, more practical, distance definitions, like the metric distance, 
typically differ from the coordinate distance by the order of the 
Schwarzschild radius of the lens, the affine distance is scaled by 
local perturbations. This is clearly not the reference situation we 
would use in an astronomical scenario, because scaling the dis- 
tances changes the area even at infinity, even though the metric is 
not affected at large distances from the lens. 

An argument supporting this view is provided by the following 
gedankenexperiment (see Appendix|Dj. The lens is a thin spherical 
shell of radius ro around the observer, and the mass is adjusted in 
such a way that a := GA'I / {tqc?) is constant. If a is sufficiently 
small, the fields are weak and all the approximations are valid. For 
symmetry reasons, light rays reaching the observer will not be de- 
flected in this situation. This is consistent with our formalism and 
leads to a magnification in solid angle of exactly unity in all direc- 
tions. In order to come as close as possible to an unperturbed met- 
ric for the Universe, we now make the radius ro arbitrarily small 
(scaling the mass M appropriately). In this limit, the metric pertur- 
bations are confined to an infinitesimal region around the observer, 
and even there they are small. Nevertheless, the focusing theorem 
demands a magnification of (1 — 2a)~^ in all directions, where the 
reference situation is defined to have an unchanged affine distance 
to the source. We can now imagine a practical experiment in which 
we remove the lensing mass shell by shifting it slightly (but 3> ro). 
This corresponds to the 'unlensed comparison situation' as the au- 
thor would define it in an astronomical context. Since the mass of 
the lens is infinitely small, the details of removing the lens are ir- 
relevant for the outcome, so that the reference situation is uniquely 
defined and corresponds to our approach of keeping the coordinate 
(or metric) distance constant. The experiment would find a magni- 
fication of exactly 1, in perfect agreement with our reasoning. The 
ajfine distance for a fixed source, on the other hand, would scale by 
a factor of 1/ \/l — 2a when the lens is put in place. 
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An interesting variant of this gedankenexperiment uses values 
of a that are not necessarily small, so that the weak-field approx- 
imation is no longer valid. This has the invaluable advantage that 
the predicted diverging magnifications can seemingly be defined 
without deciding for a specific reference situation (Schneider, priv. 
comm.). However, we show in Appendix |D] that magnifications in 
this case only diverge for values of a for which the observer would 
be located within the event horizon of the black hole which would 
then be formed by the lens. It is clear that the magnifications in 
this definition do not mean that the observer sees the source larger, 
but are purely related to the definition of the affine distance, which 
scales with the relativistic time dilation at the observer. We con- 
clude that a fixed affine distance does not define the unlensed ref- 
erence situation in a way most astronomers would like. 

Comparing magnification and amplification, we find that the 
two are not equal but are related by Etherington's reciprocity the- 
orem. The ratio of amplification and magnification defines the 
change of surface brightness in terms of photon number, which is 
directly related to the gravitational redshift of the source as seen by 
the observer. Concerning energy flux density, the surface brightness 
is modified even stronger, because photon energy and rate are also 
redshifted. These relations are completely equivalent to the well- 
known relations in cosmology between angular size distance and 
corrected and uncorrected luminosity distance. 

Finally, we find that the deflection potential is no longer pro- 
portional to the potential time-delay, as a result of different scaling 
of deflection and Shapiro delay with distance. Even more disturb- 
ing for practical calculations, the geometrical delay can no longer 
be expressed as a function of image and source position alone. This 
well-known feature of thick lenses also affects point-mass lenses in 
the case of large impact parameters. 

It should be kept in mind that the modifications discussed in 
this article are generally of extremely small magnitude, and it is 
difficult to imagine situations in which they become directly mea- 
surable. Nevertheless, the formalism does have practical applica- 
tions, for example in the case of lenses very close to or behind the 
sources, a case which is not covered by the standard thin-lens ap- 
proximation. 



found that the source function for the deflection is not the surface 
mass density itself, but the density contrast relative to the mean 
density. This result is entirely plausible, because a constant surface 
mass density can, for symmetry reasons, not lead to a light deflec- 
tion. In our formalism, magnifications are defined as directly being 
caused by light deflection and not by the indirect effect of metric 
perturbations at the positions of source or observer. It is thus clear 
that a constant surface mass density cannot lead to magnifications, 
and that the magnification theorem does not hold on the complete 
sphere. 

In contrast to this, a different co mparison situation is explicitly 
define d for the focusing theorem (e.g. ISchneider et alJl992l ; |PalicM 
l2004h . in which the affine distance is kept the same in the lensed 
and unlensed situation. With this definition, the theorem is always 
valid, even under much more general circumstances. However, we 
argue that keeping the affine distance constant is not the appropriate 
way to compare lensed and unlensed sources in an astronomical 
context. 

In addition we learn that at the level of accuracy needed for 
a discussion of the magnification theorem(s), we also have to dis- 
tinguish between magnification and amplification, and that for the 
latter we have to decide if we define it in terms of total photon num- 
ber, photon flux, total energy or energy flux. This distinction is well 
known in the context of cosmological redshifts. As a result of these 
effects, surface brightness is generally not conserved. 
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9 CONCLUSIONS 

The total magnification excess obtained from a naive interpretation 
of the magnification theorem is of the same order of magnitude as 
the perturbation of the metric at the position of observer or source 
caused by the gravitation of the lens. For a discussion of the appar- 
ent paradox, terms of this order cannot be neglected. This means 
that the lensing process can no longer be described in a tangen- 
tial plane, but full spheres have to be used for the lensed images 
and the unlensed source. Extending the planar formalism naturally 
leads to additional terms, which compensate the magnification ex- 
cess and lead to a mean magnification of exactly 1. This is the case 
for an unlensed reference situation in which the source sphere has 
the same area as the lensed source sphere. The fact that total flux 
(or solid angle) is conserved in a formalism which is based on this 
very assumption is clearly tautological. However, without the ad- 
ditional spherical correction terms, this consistency could not have 
been achieved. 

It is a very satisfying result that generalising the formalism in 
order to assure total flux conservation automatically avoids a vi- 
olation of Gauss' theorem for the deflection angle on the sphere, 
by modifying Poisson's equation for the deflection potential. It is 
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APPENDIX A: SPHERICAL POTENTIAL AS 
INTEGRATED THREE-DIMENSIONAL POTENTIAL 

AI Poisson equation for source at infinity 

We define a two-dimensional potential i/) on the sphere as a certain 
projection of the Newtonian potential 



dr — + const 



(Al) 



where we have to expect problems with the convergence and may 
have to subtract diverging terms that are independent of G. If we 
take the angular gradient Ve = rVi of this potential, we recover 
the previous equation ( I21t for the deflection angle. We can there- 
fore omit the tilde and write instead. 

Now we want to calculate the two-dimensional angular Lapla- 
cian of the potential. We have Vg — and thus 



2 r°° 

V|^(6>) = — / drrVi* 
c Jo 



(r) 



(A2) 



The integrand can be calculated by using the three-dimensional 
Laplacian 

1 a2(r*) 



r UT~ 

and the three-dimensional Poisson equation 
V3*(r) = AnG p{r) 

This leads to 



(A3) 



(A4) 



47tG 



/■oo n c 

/ Arrp{r) — 
Jo Jo 



dr 



9^ [r*(r)] 



87tG p , , , 2 
— drrp(r)-- 



*(r) + r 



g^2 



9*(r) 



9r 



(A5) 



(A6) 



We assume that the mass is localized and finite, so that the second 
term vanishes at infinity. We are left with the potential ^'o = '5(0) 
at the position of the observer: 

8nG 



VeV(0) = ^ / drrp{r) + -*„ 
c Jo ^ 



We use 



dV^ 



drr jdQ,{e)p{r) 
drr'pir) , 



*o = -G 

= -47tG 
with the mean density at radius r. 

After inserting this into Eq. l lA7t . we find 

87tG 



Ve>(0) 



dr r [p{r) — p{r)] 



(A7) 

(A8) 
(A9) 
(AlO) 

(All) 

(A 12) 



With the definition for the normalized surface mass density from 
Eq. i35t and the equivalent for a, we finally arrive at the Poisson 
equation | |4U : 



VV(0) = 2 [a(6») - a] 



(A13) 



A2 Poisson equation for source at finite distance 

We can generalize the calculations from last section for finite _Ds. 
For this purpose we have to stop the integration at Ds and radially 
scale the potential in the same way as the deflection angle, compare 
Eq. l|46j with This leads to 



(A14) 



and after integrating in the same way as before, where we now in 
addition to the potential at the observer's position also have a 
contribution from the potential at the source position ^s, to 



V^^l)(e) = 2 



(A 15) 



The normalized surface mass density a is now defined according 
to Eq. l l65t . Please note that the additional potential terms can no 
longer be calculated from the surface mass density alone, because 
they also depend on masses outside of the source sphere, which do 
not contribute to a. 



A3 Potential for point mass at finite distance 



We start with Eq. jA14t and insert the potential for a point mass at 
the position defined in Eq. il8t . 



2GA/ 



az ^=^= 

Va;f, + (z 



(A16) 



zo) 



This integral is divergent at the lower limit, but can be regularised, 
and after some simplifications reduces to 



^{9) = 



2GM 



In [Dj-D, cos9 + Di,{e)] 



+ 



In [A - At cos e + Dd,(e)] - ln(l ~ cos 9) 



(A17) 



APPENDIX B: EXACT MAGNIFICATION MATRIX ON 
THE SPHERE 

We want to calculate the magnification matrix for arbitrary func- 
tions of the deflection angle. This includes large deflections and 
multi-plane lenses, where the deflection angle can no longer be 
written as the gradient of a potential. This generality is not nec- 
essary for the main part of this paper but may serve as the basis for 
future work. 

In the plane, the total displacement is not relevant, so that the 
magnification matrix is determined exclusively by the first-order 
derivatives of the deflection. On the sphere, we have to take into 
account the curvature, and the lens equation is no longer a vector 
equation. 

To determine the source position Os from the image position 
O, we have to move along a geodesic (or great circle) in the direc- 
tion of the negative deflection angle and follow this geodesic for a 
length corresponding to the absolute deflection angle. The geodesic 
equation for arbitrary coordinates is 







(Bl) 



The affine parameter A runs from at to 1 at O,. Derivatives 
with respect to A are written as dots. In the following, we write the 
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deflection angle as (with = a^^a^) to avoid confusion with 
tensor indices. The boundary conditions are 



a;"(0) = e° 



'(0) = -c 



(B2) 



For the magnification matrix, we have to consider additional 
geodesies infinitely close to the reference geodesic. The equation 
for the difference e^", where e is infinitely small, is the differential 
equation for the geodesic deviation: 



DA2 



x' x'^t^ R 



(B3) 



The differential operator D denotes covariant derivatives. The cur- 
vature tensor R has a particularly simple form for two-dimensional 
manifolds. It can be written in terms of the metric p^,^ as 



R 



:= r 



r"/3,^ + Tp^rj^,^ — Tpi^rj^^ (B4) 



1 



(B5) 



The curvature radius K is constant {K = 1) on the sphere. The 
limit of the tangential plane can be found as if — > cxa. Eq. is 
valid in any coordinate system. For our convenience we use the 
system defined by the coordinates at O, which is then parallel- 
transported along the geodesic. In this way, the covariant deriva- 
tives become partial derivatives of the components, and i" = 
(a, 0) as well as the curvature tensor Eq. jB5b have constant com- 
ponents. We use a local Cartesian system (with locally vanishing 
Christoffel symbols) in which ^" is measured parallel to the nega- 
tive deflection angle and f orthogonal to this direction. This leads 
to 

ell n f-L 2fX 1 



K 



(B6) 



With the starting condition 



(B7) 



DA Da;'' 
from the derivative of Eq. l lB2b ). we can easily solve the differential 
equation l lB6t for the two starting vectors (1, 0) and (0, 1) and in 
this way write the transport equation from Q to 6s for arbitrary 
vectors ^'^ as 



1(1) = M-^^(O) 



(B8) 



The inverse magnification matrix of this mapping in (||, _L) coordi- 
nates reads 



/ 1 



(B9) 



in terms of the derivatives (covariant or partial in these coordinates) 
of the deflection function a^. The magnification depends on the 
derivatives, but also on the deflection angle itself, just as expected. 
We notice that finite deflection angles introduce rotation even if the 
deflection field is rotation-free (a!'^ — a^^). Furthermore does the 
curvature of the sphere lead to a magnification of 1/ cos lo in the 
perpendicular direction even for (locally) constant deflection fields. 
In the interpretation of this, one should keep in mind that a covari- 
antly constant deflection does not correspond to a rigid rotation of 
the sphere, not even locally. 

We can decompose the inverse magnification matrix into a ro- 
tated convergence and a shear part. 



M 



(1 



cos (p — sm if 
sin (f cos if 



71 
72 



72 
-71 



(BIO) 



where the parameters are determined by the following equations: 



tan (f — 



1-K- 



a , — a II sm 



uj/u! 



A21 - A12 _ 

All + A22 1 + costj — a"|| — a\ sinuj/uj 
sign(Aii + ^22^ 



(BID 



71 = 



72 = 



All - A2 



V{All+A22)^ + (A12 - A2l)2 

(B12) 

cos (xJ — l+a"|| — sin co/lu 



A12 + A21 a"j_ + a^, sin Lo/uj 



(B13) 



(B14) 



2 2 
Note that the mapping is invariant under a sign change of 1 — k 
with a simultaneous shift of n in ip. Equations dBlIl l and l lBI2t are 
consistent for —n/2 < ip < n/2. 

In the limit of small cu (corresponding to small deflection an- 
gles or if ^ 00), the matrix reduces to the standard form 



1 — fl y X 

-a^ 1 - 



(B15) 



APPENDIX C: ALTERNATIVE DISTANCE MEASURES 
CI Afflne distance 

The so-called /oc«.sOTg theorem (e.g. |Perlic3l2004l) is similar to the 
magnification theorem but is valid for arbitrary space-times, includ- 
ing strong fields and regions far from the optical axis. It says that an 
image created before the first conjugate point (before parity inver- 
sion) is always magnified with a factor \fiA\ 1 compared to an 
unlensed source at the same affine distance. This is different from 
our discussion, because we kept the coordinate distance fixed. 

The affine distance is measured as the affine parameter of a 
null-geodesic connecting two events, where the parameter is nor- 
malized to approach metric distance very close to the observer. To 
compare this with our result, where the magnification can be less 
than unity, we have to calculate the affine distance of the lensed im- 
age. We are only interested in first-order effects in ^, which makes 
this task easy. Deviations of the geodesic from a straight line in co- 
ordinates would be of second order and can be neglected. We thus 
only need one spatial coordinate x measured along the geodesic. 
We use the metric from Eq. l lI6t to define a Lagrange function for 
a geodesic, writing derivatives with respect to the affine parameter 
A as dots. 



' dA 



2* 



2* 



(CI) 



With the Euler-Lagrange equations, we derive to first order x = 
const, and with the normalization at the observer, find 



1 + 



(C2) 



where ^'o is the potential at the observers position. This means that 
the affine distance A is the coordinate distance scaled with the con- 
stant 1/x. To redefine the magnification as a comparison with an 
unlensed source at the same affine distance, we have to scale it with 
(A/i)s)^. For a compact lens of mass M this leads to the first-order 
correction 



flA = fJ-ll 



2*0 



= M 1 



2GM \ 



"2 / 



(C3) 
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For Ds <x this cancels the m/2 term in Eq. OH . which pro- 
duced magnifications less than unity for large 6. The same is true 
for the case with finite Ds in Eqs. l l6H463t . See Fig.|6]for a graphical 
illustration. 



C2 Metric distance 

Alternatively, we could for the comparison of lensed and unlensed 
situation keep the metric distance L constant, which we define as 
measured with a rigid ruler being at rest in the {t,r) coordinates 
of the weak-field metric. This can be integrated easily for a point 
mass at the standard position of Eq. l llSt : 



L = y x/-ds2| 
„ GM 



-0- r^^ 

Jo 



*(a; = 0,z) 



^ GM , 

= -Ds + In 

„ GM , 
= D, + — - In 



\/xl + {z-zof 
D,~Di cos + Di,{9) 



Dd{l - cos9) 
Di,{e)+Di-D, 



(C4) 
(C5) 
(C6) 
(C7) 



To correct the magnifications to be valid for fixed L, they have to 
be multiplied with (L/Df,)^ in the same way as above. Results are 
shown in Fig.|6] 



affine distance can be calculated with methods similar to those de- 
scribed in Appdendix ICll which results in 

r 



A : 



(l + o-)r . 



(D3) 



The magnification defined for an unchanged affine distance scales 
with 



MA 



2a 



(D4) 



which obeys the focusing theorem /ia > 1. 

The critical case a — 1/2 corresponds to a lens which forms a 
black hole, which is clearly beyond the weak-field approximation. 
Even in more moderate cases, we see that the "focusing" is purely 
a result of the local measurement of the affine distance, which is 
affected by the metric distortions. 

This paper has been typeset from a TgX/ MTgX file prepared by the 
author. 



APPENDIX D: LENSING BY A SPHERICAL SHELL 

A very instructive example is that of a thin spherical shell acting as 
a lens, with the observer in the centre of the shell and the source 
sphere at infinity. Due to the symmetry, radial light rays are not 
deflected, so that we do not expect magnification caused by de- 
flection. In the formalism of this paper, see Eq. l l41t . the constant 
surface mass density leads to a constant potential and no deflection 
or magnification. In the formalism of the focusing theorem, on the 
other hand, where the affine distance is kept constant to compare 
the lensed and unlensed situation, we still have focusing without 
deflection. One might think that the case of infinite focusing would 
allow a discussion without the arbitrariness of distance definitions, 
because different distances can only rescale the focusing. 

In order to discuss this case, we have to leave the weak-field 
approximation and use the full metric for a shell with total mass 
M, Rm = GM/(? and radius ro in Schwarzschild coordinates: 

for r < ro 

dr'^ — r^ dJl^ for r > ro 
(Dl) 

We are especially interested in the limiting case of undistorted 
global geometry of the Universe, therefore we consider very small 
ro and M, but keep Rm /ro constant. Using Eq. (|6j it can be shown 
that in the weak field limit we have 

a = , (D2) 

ro 

which we extend as a definition of a for strong fields. In the limit 
of small Rm, the metric is distorted only in a small region around 
the observer. The radial metric distance then corresponds to r. The 
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